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Abstract 
This paper proposes a family of estimators of population mean using information on several auxiliary 
variables and analyzes its properties in the presence of measurement errors. 
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1. INTRODUCTION 

The discrepancies between the values exactly obtained on the variables under consideration for sampled 
units and the corresponding true values are termed as measurement errors. In general, standard theory of 
survey sampling assumes that data collected through surveys are often assumed to be free of measurement 
or response errors. In reality such a supposition does not hold true and the data may be contaminated with 
measurement errors due to various reasons; see, e.g., Cochran (1963) and Sukhatme et al (1984). 

One of the major sources of measurement errors in survey is the nature of variables. This may happen in 
case of qualitative variables. Simple examples of such variables are intelligence, preference, specific 
abilities, utility, aggressiveness, tastes, etc. In many sample surveys it is recognized that errors of 
measurement can also arise from the person being interviewed, from the interviewer, from the supervisor or 
leader of a team of interviewers, and from the processor who transmits the information from the recorded 
interview on to the punched cards or tapes that will be analyzed, for instance, see Cochran (1968). Another 
source of measurement error is when the variable is conceptually well defined but observations can be 
obtained on some closely related substitutes termed as proxies or surrogates. Such a situation is 
encountered when one needs to measure the economic status or the level of education of individuals, see 
Salabh (1997) and Sud and Srivastava (2000). In presence of measurement errors, inferences may be 
misleading, see Biemer et a/ (1991), Fuller (1995) and Manisha and Singh (2001). 

Thereis today a great deal of research on measurement errors in surveys. An attempt has been made to 
study the impact of measurement errors on a family of estimators of population mean using multiauxiliary 


information. 
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2. THE SUGGESTED FAMILY OF ESTIMATORS 


Let Y be the study variate and its population mean [1 to be estimated using information on p(>1) auxiliary 


variates X,, Xo, ...,.Xp. Further, let the population mean row vector i = (u, >My. H, ) of the vector 


= (x XX, ). Assume that a simple random sample of size n is drawn from a population, on the 


study character Y and auxiliary characters X;, Xo, ....X». For the sake of simplicity we assume that the 


population is infinite. The recorded fallible measurements are given by 


Yi a¥, +8, 
x, =X, +ny, £=12,--+,p; 
fsl2,.33 

where Yj and Xj are correct values of the characteristics Y and Xj (i=1,2,..., p; j=1,2,..., n). 
For the sake of simplicity in exposition, we assume that the error E;'s are stochastic with mean 'zero' and 
variance Sey and uncorrelated with Y;'s. The errors nj in x are distributed independently of each other 
and of the X; with mean 'zero' and variance ow (i=1,2,....p). Also E;'s and njj's are uncorrelated although 
Y;'s and X;;'s are correlated. 


Define 
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u, = 

H; 
Po [> . 

u = (4,05, iis) (1,1, A ee 
1 n 

VE 2s 
N j=l 
1 n 

x, = Xi 
N j= 


With this background we suggest a family of estimators of Lo as 


i, = gly,u") 


where gy.u") is a function of V,U,,U,,°**,U,, such that 


§ (u.c") = Ho 
> ast) =] 
F | ue") 


and such that it satisfies the following conditions: 


1. The function ol¥, u ") is continuous and bounded in Q. 


(2.1) 


2. The first and second order partial derivatives of the function ol¥, u’ ) exist and are continuous and 


bounded in Q. 


To obtain the mean squared error of [A g> we expand the function a7, ic ) about the point (toe') ina 


second order Taylor's series. We get 


fi, = g(uy.e” )+(9- pS 
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(2.2) 


where 
V* = Uy +0(y- Mo), u*= e+0(u —e), 0<0 <I], ge) 
denote the p element column vector of first partial derivatives of g(-) and g(-) denotes a pxp matrix of 
second partial derivatives of g(-) with respect to u. 
Noting that g({19,e')= [Uo, it can be shown that 
E(i,)= 1) +O") 
(2.3) 


which follows that the bias of [1 g 1S of the order of n”, and hence its contribution to the mean squared 


error of [1 g Will be of the order of ni. 


From (2.2), we have to terms of order n'', 


2 


MSE(\i, )= E(5— 1,)+(u-e)" uer)} 
= El(¥ = py) +2AP- py ue)’ oe") 
+ (eer) w—eXu—e)" (2) 


2 “li (C2 + €2))+ 21672 ner)+(2 wer)" Ale he") 


(2.4) 
where b'=(b,,b.,. : bp), bi=PoiCoC;,(@=1,2, oa oP); 
C=0//Ui, Cor Oj/ Li, (i=1,2, a »P) and Co-Go/Ho, 
Cc + Ci) PrGQC, PpOC, = pi,GC, 
POC, Ce +C% P30,C,; + Pr,CoC, 
A=) pyuGC;. pyc +C%) + p3,0,C, 
L Pirie, P2,C,C, P3,C3C, a C; +CO) | 
The MSE((i ‘ ) at (2.4) is minimized for 
(1) ya ees 1 
Zo '(uoe™) =—WoA 6 
(2.5) 
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Thus the resulting minimum MSE of a is is given by 


min.MSE(i, )= (u2 /n)|C? + Co) ~b" A"b| 


(2.6) 
Now we have established the following theorem. 
Theorem 2.1 = Up to terms of order n”, 
in 2 2 2 Pst 
MSE((i, )> (2 /n Jc? + C2, -b7 4710] 
(2.7) 


with equality holding if 
Oise) = pdb 
So '(Ho.e") = Ho 


It is to be mentioned that the family of estimators u g at (2.1) is very large. The following estimators: 


i=l 


Pp 
=; S°@, =1, [Sahai et al (1980)] 


£ 
= 
if 


L 


p oe: Pp 
egy = T( Bi : @, =1, [Mohanty and Pattanaik (1984)] 
= 


@, =1, [Mohanty and Pattanaik (1984)] 


Ss 
ve _ 
QD 
Il 
<I 
Lone ag 
‘(Mr 
a) 
Ral 
amen 
Ms: 


I 
an 
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a_i} (Hi), Sf d a ee 
Hw, =y Yo, Pr > : Yio, + Yo, ; [Srivastava (1965) and Rao 


and Mudhalkar (1967)] 


ssf, Vet 
ii (12) _ rT ) (a ,'s are suitably constants) [Srivastava (1967)] 


Pp ee 
ie = zit - (2. [Sahai and Rey (1980)] 


P 
eee il i [Walsh (1970)] 
exp {30 logu, [Srivastava (1971)] 
~ (16) — e ; 
UW, = yexp >6,(u, 1) [Srivastava (1971)] 
i=l 


Dp 


p 
we? = po , exp {6, /@,) )log u,}: xo ; =1, [Srivastava (1971)] 


i=l tl 
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etc. may be identified as particular members of the suggested family of estimators [1 igs The MSE of these 


estimators can be obtained from (2.4). 


It is well known that 
V(5) = (u2 /n C2 +2) 
(2.8) 
It follows from (2.6) and (2.8) that the minimum variance of [A Z is no longer than conventional unbiased 


estimator ) . 
On substituting 6) =0, OW =0 Vi=1,2,...,p in the equation (2.4), we obtain the no-measurement error case. 


In that case, the MSE of ul 2 is given by 


MSE(ji, )= ‘Nc; 12 + 2b" g* uct) He *er)) A*(g (1,6) 


= MSE(ii, *) 
(2.9) 
where 
ul oF eee a 
Hi, Hy Ll, 
=¢g * (7,u") 
(2.10) 


and Y and De (i =1,2,---, p) are the sample means of the characteristics Y and X; based on true 
measurements. (Yj,Xj, 1=1,2,...,p; j=1,2,....). The family of estimators u % * at (2.10) is a generalized 
version of Srivastava (1971, 80). 


The MSE of rt 3 * is minimized for 


glist) =- A! by 
(2.11) 


Thus the resulting minimum MSE of ul @ * is given by 
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min.MSE(ji, *)= Mo [e2 ~b'A* p| 
nN 


2 
ie) 
oat ee 


(2.12) 
where A*=[a*;] be a pxp matrix with a*,, = pCjC; and R stands for the multiple correlation coefficient of 
Yon X1,X9,. : Xp. 


From (2.6) and (2.12) the increase in minimum MSE (ii . ) due to measurement errors is 


obtained as 


2 
min.MSE(ji, )~ min.MSE(ji, *)= [Hee +b"A*'b—b" AD] 
nN 
>0 
This is due to the fact that the measurement errors introduce the variances fallible measurements of study 


variate Y and auxiliary variates X;. Hence there is a need to take the contribution of measurement errors 


into account. 


3. BIASES AND MEAN SQUARE ERRORS OF SOME PARTICULAR ESTIMATORS IN THE 


PRESENCE OF MEASUREMENT ERRORS. 


To obtain the bias of the estimator [1 g> We further assume that the third partial derivatives of ol¥, u’ ) 


also exist and are continuous and bounded. Then expanding ol¥, ur ) about the point 


(5, ur ) = (u, : e”) in a third-order Taylor's series we obtain 


ti, = g(u.e7)+(F- p, JEY 
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3 
1 0 0 * 
+ y +(u-—e ae 
Ho bol ( 2} glF".u*") 
(3.1) 
where g''”(t1,e') denotes the matrix of second partial derivatives of ol¥, ut ) at the point 
(7.u")=(u0.e"). 
Noting that 
glue” )= by 
ag) Ly 
DF Vane’ 
dg () =) 
5 = 
PN 
and taking expectation we obtain the bias of the family of estimators [1 gz (0 the first degree of 
approximation, 
abi, = 4] elec" lebvellu-e}+2{* pave 
H, 2 u—e) \§ (119.e") u-—ej+ & (110.e") 
(3.2) 


where b'=(bj,by,... .,bp) with bi=poiCoC;; (i=1,2,..., p). Thus we see that the bias of ul ; depends also upon 


the second order partial derivatives of the function on eV, u’ ) at the point ({1o,e'), and hence will be 
different for different optimum estimators of the family. 
The biases and mean square errors of the estimators [1 Nie =1to18 up to terms of order n' along with 


the values of g(119,e"), g(119,e") and g"(19,e") are given in the Table 3.1. 
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Table 3.1 Biases and mean squared errors of various estimators of LU 
ESTIMATOR g(uo,e") g(u,e") g"(U9,e") BIAS MSE 
— Hy @ 2H) W -@ : 
é : ae ta \(cr Ww -b' 0) [4 c: +Co, — 2b" @+a@' A o 
~ n PXp es n a, ey = 
Mg 
T 2 2 72 2 2 2 
where Woxp=dig(@1,@2,...,@p) where C” = (c + Ci» Co + Cay, + chy) 
Hy @ O a) 2 
“e ~ exp : [Mor [# Je + Cfo) + 2b" 4 o” Aa 
poe (null matrix) ut " ~~ : 
Hy o* 2 [by OF o*" a o*"’Ao b'o* 2 2b’ w* w*' Ao* 
@) TF ae eae ae T Ho |) _~ a Ho || C24? —4+— ~ 
HM, oe ei ee njho'uuo op n ai o'u o'um'o 
where @*! = 
(@1,*02,*...,@p*) with 
(@j,*=@j Hi) 
(i=1,2,...p) 
; = 
al 72 : (4); My CLC 2" @ w Ao 
7 ‘ r 
ae OH (null matrix) Ce UE n ‘ ae Hw o'Lu'o 
— Ly, @ -@ 4 
m a (oo! + W ~ (4) Cc "A@+C* iE ; 2b’ w [4 lc? +Ci -2b' a+@ Aw 
im x oe n ~ ~ 
Mg 


~ (6) = r = 

MU, Hy @ 2,0 @ a ts) lo "Ao-b" _ Mo lc? + Ci, — 2b" w4 o” Aa| 
n i ce a n o a 

4 7) Ly @ r @ fe VT, : 

Mg es (oe -B | fs) o'Aw@-C'W  +2b" o| Mo lc: +C?. +2b" o4 wo” Aa| 
ee eae mr : ; 0 (0) a 

~ (8) Ly @ T Q My \F : 

He One 2u,( oo -¥,,,) x ts) o'Aoa-C'W +b o| Ho lc: Ce + 2b" w4 aes 
n L~ ~ PXxp oS n 2 ~ 
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Table 3.1 Biases and mean squared errors of various estimators of Uo 


ESTIMATOR g'(uo,e") g(uo,e") g"(Uo,e") BIAS MSE 
~ (9) _ = 2 
Hs ee PHONE ap - aa [ctw -6" 0) Ho lc? 42, +26" or 0" Ao 
n ~ pxp © n : 0) ~~ ~ 
~ (10) 2 
Ly @ O Qo 
‘ 0 : - [4 or [# Jk 12, +20" ora" Ao 
n ~ n ne as a 
Bi) Ho 2W Ho 1) 
ms ~ (pxp s 
a mw) Ho T T LU . 
g Fo * 0 Be Ae T r 
Where © (1) =(-O1,-O2,..-5 “Ogy ~Ogtts---4Op) xp ; le Mae b mn | ; |< BCE tee oe a) A o 


0 O 0 
a, 90 0 
0 oa, 0 
0 0 Oo 
0 O 0 
0 O 0 


So 


0] pxp 


C*T=(C74Cay’,. a) 


Cq’+C(qy 3---0) 


55 


2 
0% fy w(aa"—ee,., a ail Ec * Aa=C" +26" | Ho llc? +2, +26" ata” Aa| 
~ (12) 2H) AL : Pe n S Ra 8 
& 
where @ T =( 0} 40l24.+-40p) xp oc =diag(a),d,...,0p) 
a (13) -—a -—@a 2 
g 4 Ho (ae aan : (4) [CT -a"Aa~20" a| Ho |e + Ci, — 2b" a a Ae 
2n pxp~ ~ ~ n Be zs 
~ (14) -a 2U,aa -a : 
g 2 Ho oe ~ [A] 4a—cr -0"a| a Cay 2 ara" Aa 
i i : . es ee 
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Table 3.1 Biases and mean squared errors of various estimators of Lo 


ESTIMATOR g(uo,e") g(o,e") g"(Uo,e") BIAS MSE 
My, (007-0 } 0 
~ (15) a. fie fe v0~ i 2 
g [s]2 "A0-C’®O +26" 6 | eal lc? + Co) + 2b" 04 a” Ao] 
where @ — =diag{1,0,,... 5} 2njl~ — ~ ee 2 n ee 
~ Pxp 
fy 9) Hy O H,00" 0 Ul Ms 
8 : ae : a lo 40426" 6| : lc? + Cf) + 2b" 64 a” Ao] 
2n a" ae xe n ~ ie re 
iy (7) 0 OF nadia 0 : 
H, Mo” re NES if dai cr o*,., +26" 9| a lc: KCl) # 2b" OY a” Ao] 
2n w n (0) os a ns 
where ON 3 =diag {0, (2. ] a(S = } } 
Q, 0, 
* Unbiased 
. < pxp Oy (4 }[e3 $C tli atta® A a] 
~ (18) n s as ad 
& 


where a* ¥ =(0t1#,012%,...50p*) with a =(Gi,Hi, 1=1,2,...,p) 
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4. ESTIMATORS BASED ON ESTIMATED OPTIMUM 

It may be noted that the minimum MSE (2.6) is obtained only when the optimum values of constants 
involved in the estimator, which are functions of the unknown population parameters Lo, b and A, are 
known quite accurately. 

To use such estimators in practice, one has to use some guessed values of the parameters [1o, b and A, either 
through past experience or through a pilot sample survey. Das and Tripathi (1978, sec.3) have illustrated 
that even if the values of the parameters used in the estimator are not exactly equal to their optimum values 
as given by (2.5) but are close enough, the resulting estimator will be better than the conventional unbiased 
estimator ) . For further discussion on this issue, the reader is referred to Murthy (1967), Reddy (1973), 
Srivenkataramana and Tracy (1984) and Sahai and Sahai (1985). 

On the other hand if the experimenter is unable to guess the values of population parameters due to lack of 


experience, it is advisable to replace the unknown population parameters by their consistent estimators. Let 
~ be aconsistent estimator of 6=A’'b. We then replace } by @ and also by by V if necessary, in the 


optimum { g resulting in the estimator Ll g(est)> SAY; which will now be a function of Y , uand >. Thus we 


est)? 


define a family of estimators (based on estimated optimum values) of [1 as 


fies =§ *#(5,u7 6") 
(4.1) 


where g**(-) is a function of (5, ia go" ) such that 


g**(u1,.e7 6" )= My for alll Hy, 


25498 30) w 
V Mu’ 6) 
og” 0 = oat) =—HA'b=—pp 
ou (uo eg" ) ou (uo e) 


(4.2) 


and 
Og * *(. 
eet) ; 
OP Mune 9") 
With these conditions and following Srivastava and Jhajj (1983), it can be shown to the first degree of 
approximation that 


MSE((,;.,.))= min.MSE(/i, ) 


2 
S| aC ba] 
nN 


Thus if the optimum values of constants involved in the estimator are replaced by their consistent 


estimators and conditions (4.2) hold true, the resulting estimator Ll aCeis) will have the same asymptotic 


mean square error, as that of optimum 2 g- Our work needs to be extended and future research will explore 


the computational aspects of the proposed algorithm. 
REFERENCES 


Biermer, P.P., Groves, R.M., Lyberg, L.E., Mathiowetz, N.A. and Sudman, S. (1991): Measurement errors 
in surveys, Wiley, New York. 

Cochran, W. G. (1963): Sampling Techniques, John Wiley, New York. 

Cochran, W.G. (1968): Errors of measurement in statistics, Technometrics, 10(4), 637-666. 

Das, A.K. and Tripathi, T.P. (1978): Use of auxiliary information in estimating the finite population 
variance. Sankhya, C, 40, 139-148. 

Fuller, W.A. (1995): Estimation in the presence of measurement error. International Statistical Review, 63, 
2, 121-147. 

John, S. (1969): On multivariate ratio and product estimators. Biometrika, 533-536. 

Manisha and Singh, R.K. (2001): An estimation of population mean in the presence of measurement errors. 


Jour. Ind. Soc. Agri. Statist., 54 (1), 13-18. 


59 


Mohanty, S. and Pattanaik, L.M. (1984): Alternative multivariate ratio estimators using geometric and 
harmonic means. Jour. Ind. Soc.Agri. Statist., 36, 110-118. 

Murthy, M.N. (1967): Sampling Theory and Methods, Statistical Publishing Society, Calcutta. 

Olkin, I. (1958): Multivariate ratio estimation for finite population. Biometrika, 45, 154-165. 

Rao, P.S.R.S. and Mudholkar, G.S. (1967): Generalized multivariate estimators for the mean of a finite 
population. Jour. Amer. Statist. Assoc. 62, 1009-1012. 

Reddy, V.N. and Rao, T.J. (1977): Modified PPS method of estimation, Sankhya, C, 39, 185-197. 

Reddy, V.N. (1973): On ratio and product methods of estimation. Sankhya, B, 35, 307-316. 

Salabh (1997): Ratio method of estimation in the presence of measurement error, Jour. Ind. Soc. Agri. 
Statist., 52, 150-155. 

Sahai, A. and Ray, S.K. (1980): An efficient estimator using auxiliary information. Metrika, 27, 271-275. 

Sahai, A., Chander, R. and Mathur, A.K. (1980): An alternative multivariate product estimator. Jour. Ind. 
Soc. Agril. Statist., 32, 2, 6-12. 

Sahai, A. and Sahai, A. (1985): On efficient use of auxiliary information. Jour. Statist. Plann. Inference, 12, 
203-212. 

Shukla, G. K. (1966): An alternative multivariate ratio estimate for finite population. Cal. Statist. Assoc. 
Bull., 15, 127-134. 

Singh, M. P. (1967): Multivariate product method of estimation for finite population. Jour. Ind. Soc. Agri. 
Statist., 19, (2) 1-10. 

Srivastava, S.K. (1965): An estimator of the mean of a finite population using several auxiliary characters. 
Jour. Ind. Statist. Assoc., 3, 189-194. 

Srivastava, S.K. (1967): An estimator using auxiliary information in sample surveys. Cal. Statist. Assoc. 
Bull., 16, 121-132. 

Srivastava, S.K. (1971): A generalized estimator for the mean of a finite population using multiauxiliary 
information. Jour. Amer. Statist. Assoc. 66, 404-407. 

Srivastava, S.K. (1980): A class of estimators using auxiliary information in sample surveys. Canad. Jour. 


Statist., 8, 253-254. 


60 


Srivastava, S.K. and Jhajj, H.S. (1983): A class of estimators of the population mean using multi-auxiliary 
information. Cal. Statist. Assoc. Bull., 32, 47-56. 

Srivenkataramana, T. and Tracy, D.S. (1984):: Positive and negative valued auxiliary variates in Surveys. 
Metron, xxx(3-4), 3-13. 

Sud, U.C. and Srivastava, S.K. (2000): Estimation of population mean in repeat surveys in the presence of 
measurement errors. Jour. Ind. Soc. Ag. Statist., 53 (2), 125-133. 

Sukhatme, P.V., Sukhatme, B.V., Sukhatme, S. and Ashok, C. (1984): Sampling theory of surveys with 
applications. Iowa State University Press, USA. 

Tuteja, R.K. and Bahl, Shashi (1991): Multivariate product estimators. Cal. Statist. Assoc. Bull., 42, 109- 
115. 

Tankou, V. and Dharmadlikari, S. (1989): Improvement of ratio-type estimators. Biom. Jour. 31 (7), 795- 
802. 


Walsh, J.E. (1970): Generalization of ratio estimate for population total. Sankhya, A, 32, 99-106. 


61 


